Building a BioWordNet Using WordNet Data Structures and WordNet’s Software Infrastructure–A Failure Story

نویسندگان

  • Michael Poprat
  • Elena Beisswanger
  • Udo Hahn
چکیده

In this paper, we describe our efforts to build on WORDNET resources, using WORDNET lexical data, the data format that it comes with and WORDNET’s software infrastructure in order to generate a biomedical extension of WORDNET, the BIOWORDNET. We began our efforts on the assumption that the software resources were stable and reliable. In the course of our work, it turned out that this belief was far too optimistic. We discuss the stumbling blocks that we encountered, point out an error in the WORDNET software with implications for research based on it, and conclude that building on the legacy of WORDNET data structures and its associated software might preclude sustainable extensions that go beyond the domain of general English.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

WordNet-Inspired Terminological Resources for Bio-NLP

WordNet is currently the most widely used lexicon resource for general English language. We here argue in favor of a similar lexical resource for biomedicine, BioWordNet, to extend the virtues of WordNet to this sublanguage domain. We present a simple approach to semi-automatically build up such a resource. It crucially builds on the conversion of structured domain knowledge taken from the Open...

متن کامل

A Strategy of Mapping Polish WordNet onto Princeton WordNet

We present a strategy and the early results of the mapping of plWordNet – one of the largest such language resources in existence – onto Princeton WordNet. The fundamental structural premise of plWordNet differs from those of most other wordnets: lexical units rather than synsets are the basic building blocks. The addition of new material to plWordNet is consistently informed by semantic relati...

متن کامل

Estimating the inter-story drift in high rise buildings with the flexural and shear cantilever beam and mode-acceleration method

In this study, the seismic inter-story drift of structures is estimated by a combination of mode-acceleration equations with the modelling of high-rise buildings with flexural and shear cantilever beams. In the equation presented for calculating the inter-story drift, having less knowledge of the building is adequate and this issue is of significance in estimating the nonstructural component fo...

متن کامل

Learning WordNet-Based Classification Rules

A new classification paradigm, which automatically acquires WordNet-Based rules from a corpus, is presented. The approach is applied to developing an autonomous software agent that can recognize emotions which are expressed in natural language during an interactive human-computer environment. Such an agent could adapt to a user’s emotional state and dynamically adjust its interaction etiquette....

متن کامل

An Experiment: Using Google Translate and Semantic Mirrors to Create Synsets with Many Lexical Units

One of the fundamental building blocks of a wordnet is synonym sets or synsets, which group together similar word meanings or synonyms. These synsets can consist either one or more synonyms. This paper describes an automatic method for composing synsets with multiple synonyms by using Google Translate and Semantic Mirrors’ method. Also, we will give an overview of the results and discuss the ad...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008